LOS ANGELES SCIENTIFIC CENTER 
G320-2703, March 1980 


COMPUTER-AIDED MODELING AND ANALYSIS TECHNIQUES 


FOR DETERMINING MANAGEMENT 
INFORMATION SYSTEMS REQUIREMENTS 


K. EWUSI-MENSAH 


1977 LOS ANGELES SCIENTIFIC CENTER REPORTS 


+G320-2684 March 1977 ; 
A. INSELBERG, Variable Geometry Cochlear Model 
at Low Input Frequencies (60 p.) 


+G320-2685 March 1977 
E. B. FERNANDEZ & C. WOOD, The Relationship 
between Operating System and Database Security: A 
Survey (20 p.) 


+G320-2686 April 1977. 
T. LANG, C. WOOD & E. B. FERNANDEZ, Database 
Buffer Paging in Virtual Storage Systems (24 p.) 


G320-2687 April 1977 
R. C. SUMMERS & E. B. FERNANDEZ, A System 
Structure for Data Security (41 p.) 


+G320-2688 June 1977 
7 T. LANG, E. NAHOURAITI, K. KASUGA & E. B. 
FERNANDEZ, An Architectural Extension for a 
Large Database Incorporating a Processor for Dick 
Search (30 p.) 


G320-2689 July 1977 
C. WOOD, E. B. FERNANDEZ & T. LANG, Minimi- 
zation of Demand Paging for the LRU Stack Model of 
: Program Behavior (21 p.) 


G320-2690 Seotumnber 1977 


B. DIMSDALE, A Geometric Optimization Problem 


(20 p.) 


+G320-2691 September 1977 
B. DIMSDALE, Convex Cubic Splines (32 ¢ p.) 


G320-2692 September 1977 | 
B. DIMSDALE, Convex Cubic Splines !I (18 p.) 


G320-2693 September 1977 a 
E. B. FERNANDEZ, H. KASUGA, Data Control in a 
Distributed Database System (26 p.) 


1978 LOS ANGELES SCIENTIFIC CENTER REPORTS 


G320-2694 March 1978 
S. JUROVICS, An Investigation of ‘the Minimization 
of Building Energy Load Through Optimization 
Techniques (28 p.) 


1978 LOS ANGELES SCIENTIFIC CENTER REPORTS 


G320-2695 November 1978 
L. LICHTEN, An Approach to Solving Surface 
Connectivity Problems in Computer-Aided Design (41 


p.) 


G320-2696 November 1978 
C. WOOD, R. C. SUMMERS, E. B. FERNANDEZ, 
Authorization in Multilevel Database Models (28 p.) 


1979 LOS ANGELES SCIENTIFIC CENTER REPORTS 


G320-2697 March 1979 
P. NEISWANDER, A Review of the National Bureau 
of Standards Loads Determination Program (NBSLD) 
(12 p.) 


G320-2698 March 1979 
C. WOOD & E. B. FERNANDEZ, Authorization ina 
Decentralized Database System (35 p.) 


G320-2699 June 1979 
S. A. JUROVICS, Solar Radiation Data, Natural 
Lighting, and Building Energy Minimization (20 p.) 


G320-2700 August 1979 
A. INSELBERG, An Integral Equation Arising in a 
Convective Heat (Mass) Transfer Problem Through a 
_ Boundary Layer (19 p.) 


G320-2701 September 1979 
G. J. SILVERMAN, D. W. LOW, Construction of 
Optimal Synthetic Weather Data by Convex Com- 
_ bination (14 p.) 


1980 LOS ANGELES SCIENTIFIC CENTER REPORTS 


G320-2702 March 1980 
K. EWUSI-MENSAH,. Criteria for Decomposing an: 
information System Into Its Subsystems for Business 
Systems Planning (26 p.) 


G320-2703 March 1980 
K. EWUSI-MENSAH, Computer-Aided Modeling and 
Analysis Techniques for Determining Management 
Information Systems Requirements (30 p.) 


G320-2875 January 1979 
Compiled by S. E. MATOS, Abstracts of Los Angeles 
Scientific Center Reports 


~ The availability of reports is correct as of the printing date of this report. 


+ Appeared in an outside publication. Not available in Scientific Center report form. Please refer to the list of outside publications on 


~ the inside back cover for availability of reprints. 


-@ Copies of report are no longer available from the Scientific Center. 


LOS ANGELES SCIENTIFIC CENTER REPORT NO. G320-2703 


MARCH 1980 


COMPUTER-AIDED MODELING AND ANALYSIS TECHNIQUES FOR 


DETERMINING MANAGEMENT INFORMATION SYSTEMS REQUIREMENTS 


KWEKU EWUSI - MENSAH 


IBM CORPORATION 
Los Angeles Scientific Center 
9045 Lincoln Boulevard 
Los Angeles, California 90045 


ABSTRACT 


This paper deals with the problem of determining the information 
requirements or needs of a business organization in the development of 
information systems. Solution characteristics of the problem are discussed 
and several forms of models are described based on concepts in elementary 
matrix algebra and graph theory. We construct boolean matrices representing 
either data usage or creation by a process or function in a businss 
organization. Products of the two matrices are represented as directed 
graphs to depict the flow of information in the business. Two procedures are 
described for realizing the solutions obtained from the analysis. The 
models' applicability to other related areas Of information systems research 


is discussed and an example is presented from an analysis of sample data. 


Introduction 


The proliferation of information processing systems in business 
organizations due to the widespread availability of computers has seriously 
suffered from lack of comprehensive analysis of the information needs or 
requirements of management. The insufficient attention given to this 
important area of information systems development has undoubtedly resulted in 
the failure of many information systems to meet the high expectations of both 
the designers and users [1, 10]. Some attempts are currently in progress to 
correct this defect in information systems work as evidenced by recent 


publications[ see 2, 3, 5, 6, 13, 14, 16, 17 and 18]. 


The failure can be partly attributed to insufficient research into what 
Langefors [10] has characterized as the "infological problem' in information 
systems. The infological problem can be described as the problem of 
determining 'what information the system is to provide to satisfy the needs 
of its users.' This is because as Langefors rightly states 'an information 
system is well designed only if it provides the right information in kind, 
quality and time when it is needed.' This can be accomplished by knowing 
what the object system is in which the information system is embedded and 
also what are the needs or uses for information of each function or subsystem 
of the object system. The essence of the problem then is to construct a 
methodology which will provide information systems analysts and designers 


with conceptual models which describe in sufficient detail the information 


needs and problems of the business organization. 


One such methodology is IBM's Business Systems Planning (BSP). BSP is a 
eeoceduce for analyzing the information requirements of a business 
organization and developing an information systems architecture which will 
meet those needs. In the BSP approach to the analysis of the information 
requirements of management, both the top-down and bottom-up techniques are 
used[ 4]. The top-down approach is characterized by its focus on decisions 
at the managerial level of the organization [12]. The method involves 
analyzing the objectives of the business and the various processes or 
functions established to achieve those objectives. Finally the decisions 
necessary to manage those functions or processes are identified together with 
the necessary information required to support those decisions. The required 
information is in turn dependent on certain Siacdes or types of data. The 
bottom-up approach is characterized by its focus on data at the operational 
level of the organization [13]. The information requirements, in this case, 
are determined by examining all the data classes which management currently 
draws upon for decision-making. Finally an information systems plan is 
proposed based on the results of the analysis. Thus, in the analysis, design 
and implementation of the information requirements of an organization, one 


must perforce deal with data in a fundamental way. 


In this paper, we develop some computer-assisted techniques using boolean 
matrices and graph theoretical concepts to aid in the information 
requirements analysis for BSP. Our intent is not only to provide’ some 
analytical techniques currently lacking in BSP studies, but also to minimize 


the dependence of the results of the analysis on the composition of the BSP 


team. In this way, we hope to enhance both the consistency and reliability 
of the results of the study team's analysis. However, before we proceed with 
a discussion of the techniques, we digress briefly to discuss the essential 
characteristics of the information requirements analysis problem for a 


business organization. 


Characteristics of the Requirements Analysis 


Our approach is motivated by the need to find a requirements model which 
lends itself to theoretical analysis. With this in mind, we reconstruct 
BSP's matrices into boolean matrices which describe the relationships among 
vanicue data classes occurring in a business organization and the operations 
or processes the business undertakes to achieve its goals. Another reason 
. justifying this approach is that in the management of information, the 
information analyst's major concern is how to _ structure the data or 
information required by the organization [15 } The matrix model assumes a 
form in which individual data classes appear as one dimension (e.g. rows) and 
business operations or processes as the other (e.g. columns) [4 } The binary 
entries indicates either data usage or creation (denoted by 'l') of a data 
class by a business operation or process and zeros denoting non-usage or 
non-creation of a data class by a business operation. This concise way of 
structuring the organizational information makes it amenable for storage and 


theoretical analysis even for very large business organizations. 


The important considerations behind this approach to the collection and 
analysis of the business data from the perspective of the information analyst 


can be briefly characterized as follows: 


1) The approach must provide a means for the users of the information system 
to express their information requirements in terms understandable to them 
based on data available in their environment, that is both internal and 


external to the organization. 


2) The methodology must be able to express the requirements of the 
information systems in terms meaningful to the information systems 
designers. This is essential if the results of the requirements analysis 
are to serve as a valuable source of information to the systems designer 


in meeting the information needs of the user community. 


3) The results of the data collection and analysis must be stated in terms of 
those properties of the organization and the information users which are 
invariant. This is to enable the development of stable systems which can 
Meet the changing information requirements of the organization or 


enterprise as its operating conditions change. 

In the next section, we discuss what we consider to be the most desirable 
attributes of a tool consonant with the characteristics of the requirements 
analysis discussed above. 

Desirable Attributes 


Desirable attributes of the modeling techniques include the following: 


1) Analytical Capabilities - The modeling techniques must provide analytical 


2) 


3) 


capabilities for performing a comprehensive analysis on the data gathered 
from a study of the business' information needs. This is essential if the 
model is to be able to adequately cope with the complexity of the problem 
and the large volume of data involved. This feature will also enable the 
analysts to use the computational power of the computer to gain insights 
into the business information needs and problems which might escape them 


in a manual analysis. 


Flexibility, Stability and Reliability - The models must be stable, 
reliable, easy to maintain and flexible. By flexibility, we refer to the 
ability of the models to respond to changing business conditions imposed 
upon them by the business environment, without adversely impacting on the 
models' stability. Stability, on the other hand, refers to the capacity 
of the models to remain in equilibrium. The two - flexibility and 
stability - are interrelated in the sense that amodel may be atanne 
because it is flexible enough to meet the changing demands of the business 
environment and thus preserve its equilibrium. The models' reliability 
deals with the accuracy and consistency of the results obtained from the 
analysis. Stable and reliable models will enhance the credibility of the 
analysis results and minimize the undue dependence of the results of the 
data collection and analysis of the business' information needs on the 


imagination of the analysts involved with the study. 


Interactive Features - The modeling system must be easy to operate in an 
interactive mode to accommodate multiple analysts working on different 
aspects of the information requirements problem. This capability will 


encourage the analysts to develop other procedures to operate on the data, 


4) 


analyze them and arrange them in a suitable form for inquiry purposes 
[15], The feature will also permit easy modifications to be carried out 
in the course of the analysis and also to encourage sensitivity analysis 


of the results under different business environmental constraints. 


Inquiry Capabilities - This is a necessary feature to enable the analysts 
to interact effectively with the system. The power of this feature is to 
enable the analyst to selectively use the information stored in the 
computer to answer queries related to the information needs and structure 
of the business under study. In this vein procedures can be developed to 
locate any relevant information which may be requested. This will in 


effect put the memory capabilities of the computer to significant use in 


the analysis. 


These are the four main attributes we consider most desirable for the 
modeling system to possess in analyzing the information ecquiremente of a 
business organization. We are currently working on the last two 
attributes which we hope to incorporate in the final system. The 
remainder of the paper will therefore be devoted to discussions on the 


first two attributes described above. 
The Matrix Model 


The primary advantage of the matrix form of documentation and analysis 
used in the model is that it permits the analyst to describe the nature of 
the relationship between the object types (e.g. data and process) and to 


make inferences about the various relationships associated with the object 


types [8, 10 ]. 


Notation: 


We define the following terms used in the formulation of the model. 


Data Usage Matrix: A , as illustrated in Figure l. 
u 


1 If data d is used by process 


i 
A = (a..) where a... = p 
u ij ij ; 
) Otherwise 
mT 1 If process p uses data d 
A = (a...) 7; a,. = J 1 
u ji ji 
@) Otherwise 


And A! is the transpose of A . 
u u 


Data Creation Matrix: A , as illustrated in Figure 2. 


Cc 
1 If data d, is created/derived 
i 
= ° ' 
a (a ij? : = ij = by process P, 
0) Otherwise 
}1 If process p. creates/derives 
J 
A = ae ; ' 
2 (a ji a fe data d 
@) Otherwise 


ae 8 ig 
Similarly A. is the transpose of A. : 


Structure of Information Flow: 


Let Aa = ya'.ia, = (q ), as illustrated in Figure 3. 
cu i ji ik yk 
1 If and only if process p,. 
J 
creates any data d_ which 
where we define ae Pa | i 
is used by process P 
Otherwise 
Similarly A My =h a,,a', = (@ ), as illustrated in Figure 4. 
uc J 1) jk 1k 
1 If and only if data d, is 
i 
' used by an rocess p. 
‘where we define a ., 2 a a P, 


1k 
to create/derive data q 


) Otherwise 


In graph theoretical terms, the matrix ( 4) (or ( w) describes the 
existence and number of paths from one process (or data) to another 


process (or data) through any data (or process), (see for example [8]). 


Our objective inthe analysis is to construct the flow of information 
(e.g. data and process) through a business organization by establishing a 
link between the data vs. data (i.e. AA One a! matrix and the 
process vs. process (i.e. n Bee * 5% matrix. For visual impact, we use 
a flowchart to represent the structure of the information flow as a 


directed graph. The flow progresses from left to right. 


t ; 
From the data vs. data matrix (a) we obtain the following representation 
i 


which corresponds to Figure 4, with the circles and boxes representing 


data and process respectively. 


PROCESS 
i 


Fig. 5 


And from the process vs. process  ( O11) matrix we get the following 


representation, which corresponds to Figure 3. 


PROCESS PROCESS 
a i Il 
A B 


Fig. 6 


Using the results of the analysis illustrated by Figures 5 and 6, we can 
construct the flow of information (i.e. data and process) for the entire 


organization as shown below; 


PROCESS PROCESS | 
a. ci 
PROCESS 
N-1 


Fig. 7 


We now describe an algorithm which enables us to construct the information 
flow illustrated in Figure 7 above, using the information contained in 


Figures 5 and 6. 
Data/Process Flow Procedure 


The objective in this procedure is to construct the flow of information 
through the business organization by showing the transformations that each 
data class undergoes through interactions with business processes or 
functions till the final data class is realized. Any missing data class 
may thus be uncovered in the procedure, if, for example, a process accepts 
as input a data class but does not produce another data class as output. 
Such discoveries will be significant in identifying the sources of some of 
the organization's information needs and problems. We describe below the 
main steps of a procedure for generating the data-process flow paths. 

i 
T) Begin If OL 5 is not empty, Then 


Set: Data I' = Data Ih 
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Else 


End Procedure; 


II) Compare Processes I and TI. 


if Process I = Process II, Then 
Set: Process I' = Process at 


Go to Step III; 


Else 


If preceding step is I, Then 
: a > 
Output: Data I, Process I Data I. 


Return to Step I and look for new Data - 
Else 
Output the results and pop-up stack P 
Return to Step V and continue to look for 
new Data I and Process I" which follows 


current value of a... 
ik 
TII) For the same Processes I and Ht, 
ompar ata I d tI 
Cc pare D B an I 
If Data I, = Data II Then 
Set: Data II' = Data II 
Go to Step IV; 
Else 
If preceding Steps are I and II Then 
: a > r 
Output Data I, Process I Data 


B 


Return to Step I and look for new Data _ 
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Else 
Output the results and pop-up stack P 
Return to Step V and look for new 


Data I," and Process I" which follows the 


a ' 
current value of ik 


IV) Compare Processes II, and II, 
If Process II = Process II Then 
ce A B 
Update: Set: Data N = Data II' = Data II 
(That is, Data II updates itself, Data II = Data a) 
Output the Result 
Return to Step II and look for new Process os 
which follows current value of a 
Else 
Set: Process II' = Process II 
push-down Stack D (to save current 
location and value of a ys 


Lk 


Continue with the search; 
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V) Compare Data I’, and Data II' (That is, new Data Ii 


if Data II' = Data rn and 
Process II' = Process I" Then 
Set: Data III' = Data ry 


Push-down Stack P (to save current location 


and value of a., ) 


jk 
(Note: Data II' = che ae and Process II'= Process I") 
Return to Step II; 
Else 
Set: final Data = '**' (That is, unknown) 


Output the results and pop-up Stack D 
If Stack D is empty Then 
Return to Step I 
Else 
Return to Step II and look for new Processs 
which follows current value of a... 


jk 


VI) Repeat Steps I thru V till the entire paths 


generated by the matrices a. and a, 
ik jk 


(That is, Data vs. Data, and Process vs. Process) 


have been used in constructing the Data-Process 
Flow paths for the entire organization 


End: Procedure; 


In order to save computational time during the actual program 
implementation of the above procedure the search is restricted to only two 


stacks of queues D and P, representing the data vs. data matrix 
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(i.e, oe ) and the process vs. process matrix (i.e., - ) respectively. 
This restriction on the ee eee appears to be a drawback on the 
procedure's effectiveness in enumerating all the possible information 
paths. However, extensive experimental results demonstrate conclusively 
that any information path omitted as a result of the restriction is later 
recovered when the information flow paths generated by the procedure are 
represented in a tree-structure form. The power of the tree structure 
representation of the information or data-process flow paths is discussed 


later on in the paper. 


It has also been observed from sample data analyzed that the flow paths 
generated by the search procedure do invariably contain some redundant 
information. This indicates that some information flow paths generated 
tage later found to be subsets of or contained in some other information 
flow paths. We describe below a procedure for eliminating all such 


redundant information contained in the information flow paths generated. 
Maximal Complete Subgraphs Procedure 

The procedure deals with generating maximal complete subgraphs of the 
data-process flow paths enumerated. Each data or process is treated as a 
node in a directed graph and the procedure essentially looks for a match 
between corresponding nodes (in the same sequence in which they appear) in 


any two data-process flow paths. 


Notation 


14 


th | | 
P, is the k. node of the complete subgraph 


(i.e. the data-process flow path). 
M is the total number of complete subgraphs 


i=1,2,3,...-, N is the total number of nodes 


h 


iG.) denotes the qh (rt®) complete subgraph 


for node i(}j), where q, fH 152735443 


Let C = {p 


| for k=1.,2;,3«<% 
i 2k+1 P22 (K+1) I Soe 


denotes the nodes in the complete subgraph c. 
i 


excluding nodes 1 and 2 


Similarly "sh = {Poy Poe } for k=1,2,3... 


denotes nodes in the complete subgraph T. 
J 


The procedure consists of two parts - Forwardtrack and 
Backtrack. In the former case we look for complete subgraphs 
which are subsets of complete subgraphs generated earlier. 
But in the latter case, we look for complete subgraphs which 
are contained in other complete subgraphs. We now describe 
the procedure for generating the maximal complete subgraphs 


of the data-process flow paths enumerated. 
Forwardtrack 


For each complete subgraph C, andT , 
1 J 
q xr 
Lf i. << Cc. then delete T 
r q Jy 
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(1) 


(2) 


This implies that for any pair C. 


TE Poggi = Pox] 


4 
Fo (k+l) 


Then AM 


We repeat the search till all pune re 


deleted from M and we are left with My 


J 


x 


1 
q 


for k=1,2,3,... 


~ Pox 
<c C 


q 


and we delete T 


J 


rE 


and T. 
J 


r 


from M 


r 


have been 


and C, 
J 


ts 


where M =™M - a TT 
J 
Tc. sg 
I et 
Backtrack: 
Let S ={p Pp 
in 2k+1 2 (k+1) 
for 1 mm N, N-1, N-2,... 
and C. = {p 
ae’ Py OF! 
Yr 
In this case we look for complete subgraphs S. 
ef 
such that Cc. > S. 
5 1 
r g 
That is, if = 
+S 25 Bx ~ P one) 
for k=1, 2,3 ).« << 
and p = Pp 
2 (k+1) 
Then C, > S. and we delete C. from M 
Yr gq J, 
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(3) 


We repeat the procedure till all subsets C. have been 
J 


r 
deleted from M and we are left with M 
1 max 
where M =M - a On (4) 
max 1 5 
xr 
C.>Ss. 
J~ 1 
req 


Upon termination of both the Forwardtrack and Backtrack procedures, all 
complete subgraphs ane remaining out of the initial M will be maximal and 
will therefore constitute the maximal complete subgraphs of the 


data-process flow paths enumerated. 


That is, M =M - > TT, - » Cc, 
max J J 
r Yr 
Tee: Cc 2S. 
J. 1 J. 1 
req reg 


Data/Process Flow Analysis: An Example 


We now illustrate with examples taken from sample data (see Figures 1, 2, 
3 and 4) analyzed using the procedures described above. Examples of the 
type of information (i.e. data-process) flow paths represented by Figures 


5 and 6 are; 


Data vs. Data Matrix Process vs. Process Matrix 


a) ay) 
1-4-6 3-2-7 
1-4-7 3-2-8 
1-4-8 3-2-4 
1-4-10 3-2-1 
2-4-6. 3-2-2 
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2-9-13 | 4-6-12 
3-12-15 4-10-12 


3-4-7 4-8-5 
Fig. 8 


A sample of the data-process flow paths enumerated as in Figure 7 is as 


shown in Figure 9 below. 


1-4~6-12-15-33-36 
1-4-6-12-17-33-36 
1-4-6-5-3-12-15-33-36 
1-4-6-5-4-12-15-33-36 
1-4-6-5-5-12-15-33-36 
1-4-7-12-15-33-36 
1-4-7-12-17-33-36 
1-4-7-5-3-12-15-33-36 
1-4-7-5-4-12-15-33-36 


1-4-7-5-5-12-15-33-36 
Fig. 9 


Each information or data process flow path enumerated above can be 
considered as a sentence in predicate calculus [15]. This type of concise 
representation makes it possible to store vast amounts of information ina 
readily usable form. The numbers in the odd-numbered positions (i.e., 


Lp Spd poss) designate data and those in the even-numbered positions 
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(i.e.,2,4,6,...) represent processes. By this representation significant use 
is made of the computer's power to store information which can be readily 
made available to the analysts through the use of a suitable inquiry 


language. The first two rows of Figure 9 read thus; 


1. Profit Projections ~*> 4. Financial Reporting ~* 6. Expenses 
Actual + 12. Product Cycle Control > 15. Product Tracking 
and 17. Product Development Statistics ~* 33. I/S 


Service Allocation > 36. Service Actuals. 


The information contained in the above representation, although 
sufficient, is regrettably inadequate in conveying to the analyst the degree 
of interactions among the various data classes and the business processes. 
In order to overcome this drawback, all the basic information flow paths 
generated by any data class are presented together in graphical form as an 
information or data-process tree. This type of representation not only 
clearly identifies all the major data classes or processes critical to the 
business, but also presents a better picture of the complexity of the 
interconnection among the data classes and the business processes. Another 
advantage of the tree structure representation is that it enables the analyst 
to recover any flow path missed in the original search procedure due to the 
fact that the search is limited to only two stacks of queues to. save 
computational time. Figure 10 is a_ tree representation of the information 


flow paths illustrated in Figure 9. 


Application to Other Areas 
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The information requirements modeling and analysis techniques presented 
here can be applied to other related areas of research in studying the 
information systems of business organizations. In this section, we will 


briefly discuss the areas of applicability. 


* Current Information Subsystems vs Business Process Matrix. 


The information system can first be decomposed into its subsystems [see 7 
for a discussion on this]. The construction of this matrix will show 
which business processes currently have their information requirements 
either partially or fully satisfied. The matrix will serve as a basis 
for identifying any deficiencies in the current information system and 


thus suggest possible areas of improvements. 
x Current Information Subsystems vs Data Classes Matrix. 


The purpose of this matrix is to show the nature and degree of 
interactions between the present information system of the business and 
the various data classes which serve as either inputs or outputs. The 
nature of the dependence of the information system on the data siaceee 
may also help to uncover some problems such as missing data for some 
subsystems or unnecessary dependence of a subsystem on other subsystems 


due to its association with unneeded data. 
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* Organization vs Business Process Matrix 


The intent of this matrix is to show how the business processes 
identified in a business relate to the various organizational units. The 
matrix also helps to reveal the responsibilities of the various 
organizational units with respect to the processes they interact with. 
This is critical if one needs to trace any problems associated with any 
information subsystem to the organizational units which directly or 


indirectly relates to those processes. 


x Information-Related Problems 


For arriving at a solution of a comprehensive up-to-date information 
system architecture which will meet the needs of an organization, it is 
important that a determination of the information-related problems (such 
as, timeliness, tardiness, accuracy, privacy and/or security of any 
data), associated with the current information system or business 
processes are clearly identified. Once this is accomplished, various 
matrices such as information problems vs. business processes, information 
problems vs. data classes, information problems vs. organizational units 


etc. can either be constructed or determined using other matrices. 


In the case of information problems vs data class matrix, the problems 
associated with any particular data class can be identified. So that proper 
solutions can be developed which will correct for any deficiencies which may 
exist in the current information system or the current methods of data 


collection and/or processing to produce the required information. 
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Conclusions 


We have described in this paper modeling and analysis techniques based on 
concepts in elementary matrix algebra and graph theory to deal with the 
information requirements problem. It is apparent from the discussions that 
each of ee nee described serves to inform the analyst about the nature 
or state of the organization with respect to information, the current 
information system, the business processes and the data classes on which the 
business' information requirements ultimately depend. Some redundancies are 
intentionally incorporated in the analysis as a necessary check to eliminate 
any inconsistencies or apparent inaccuracies in the construction of some of 
the matrices. The data collected can thus be validated before it is 


subjected to further analysis. 


The ultimate justification for this approach to the information 
requirements problem is that it provides a better understanding of the flow 
of information through a business organization. The techniques also serve 
the purpose of widening the scope and depth of the analysis by making 
available to the analysts the computational power and storage facilities of 
the computer in finding seit ions to management's information needs and 
problems. The insight obtained from the analysis through the use of the 
computer may point to the need for more information (e.g., in the case of 
missing data). The analysis may also help to expose unneeded data which may 
be the cause of unwarranted dependence among what otherwise should be 
logically independent information subsystems. Indeed it is the computer's 


power to store large amounts of basic information using the representations 


22 


discussed in the paper which may prove significant in the long run as the 
information systems evolves to meet the changing information needs of an 


evolving business organization. 


Acknowledgement 


It is my pleasure to gratefully acknowledge the excellent comments and 
suggestions by Rita Summers on an earlier version of this paper. Helpful 
discussions with Alex Hurwitz and J. Gene Sakamoto during the course of this 
research is also appreciated. The programs implementing the procedures 
described in the paper were written in APL with the assistance of Funn Young. 
Financial support for the project was provided by the Ad. Tech. Dept. in 


DPMG. 


References 


1. Ackoff, R.; "Management Misinformation Systems," 


Management Sci., Vol 14, No. 4, Dec. 1967, p. 147-156. 


2. Bally, L., J. Brittan and K. H. Wagner.; "A Prototype 
Approach to Information System Design and Development" 


Information and Management. Vol 1, 1977, p21-26. 


3. Bentley, T. J.; "Defining Management's Information 


Needs", Proc. AFIPS, Fall Joint Computer Conf., 


Vol 45, 1976, p. 869-876. 


23 


4. Business Systems Planning - Information Systems Planning Guide. IBM 


9E20-0527-2, White Plains, New York, October 1978. 


5. Chadler, E. W. and P. Nador.; "A Technique for Identification of User's 
Information Requirements, The First Step in Information System Design", 


IFIP Cong. Proc. Vol, 1971, p. 819-826. 


6. Davis, G.; MIS, Conceptual Foundation, Structures and Development, 


McGraw-Hill, New York, 1974. 


7. Ewusi-Mensah, K; "Criteria for Decomposing an Information System into its 
Subsystems for Business Systems Planning", IBM Los Angeles Scientific 


Center Report, March, 1979. 


8. Harary, F., R. &Z. Norman and D. Cartwright; Structured Models: An 


Introduction to the Theory of Directed Graphs, John Wiley, New York 1965. 


9. Homer, E. G.; "A Generalized Model for Analyzing Management Information 


Systems", Management Sci., Vol, July 1962, p. 500-516. 


10. Langefors, B.; "Information Systems", IFIP Cong. Proc. Vol, 1974, 
p.937-945. 
ll. Lieberman, I. Jd.; "A Mathematical Model for Analyzing Management 


Information Systems", Management Sci., Vol, July 1956, p. 327-336. 


12. Miller, J. C.; "Conceptual Models for Determining 


24 


13. 


14. 


LD 


16. 


17. 


18 


Information Requirements", Proc. AFIPS, Spring Joint 


Computer Conf., Vol 25, 1964, p. 609-620. 


Munro, Malcom C. and Gordon B. Davis; "Determining 
Management Information Needs: A Comparison of 


Methods", MIS Quarterly, June 1977, p. 55-66. 


Ross, D. T. and K. E. Schoman, Jr.; "A Structured 
Analysis for Requirements Definition", IEEE Trans. on 


Software Engineering, Vol. SE-3, No. 1, Jan. 1977, p. 6-15. 


Stamper, R.; Information in Business and Administrative Systems, Halstead 


ceestrce HU 6 SUCRE 


Press, John Wiley, New York 1973. 


Taggart, W. M. Jr. and M. O. Tharp; "A Survey of 
Information Requirement Analysis Techniques", Computing 


Surveys Vol 9, No. 4, Dec. 1977, p. 273-290. 


Taggart, W. M. Jr. and M. O. Tharp; "Dimensions of 
Information Requirements Analysis", Data Base Vol. 7, 


Sept. 1975, p. 5-13. 


Teichroew, D. and E. A. Hershey III; "PSL/PSA: A 
Computer-Aided Technique for Structured Documentation 
and Analysis of Information Processing Systems", 

IEEE Trans. on Software Engineering, Vol. SE-3, No. l, 


Jan. 1977, p. 41-48. 


25 


92 


"Ta 


T 


td NOaWDOoOSB 
=| V9 o£ OD 
6) HF noegona 
_ oO tats > 
O nM O FF: HH. 
Ke 35a 3B 
49) HO wQ oo o 
©) 0K mM ,Q 

K 3 ot 

hy ct O 

}- nM 

ct . 

O 

by 

a 


ssezborg ut 470M 


SOTITTTORVd 


“AUI Spooy 
"AUI [TeTTEIeEW MeY 


‘UTA 


XOpus/A 


TetzeqeEW JO [Ita 


Zerasew sqazeg 


qZonpoirg | 
Tetoueuty 


Hbutuuetg 


Compensation 


VLVd 


Sssaxoo0dd 


Business Planning 
Org. Analysis 
Review & Control 
Financial Plan. 
Capital Acquisit. 
Research 
Forecasting 
Design & Develop. 


Prod. Spec. Maint. 


Purchasing 
Receiving 
Inventory Control 
Workflow Layout 
Scheduling 
Capacity Plan. 
Material Regqmts. 
Operations 
Territory Mgmt. 
Selling 

Sales Admin. 
Order Servicing 
Shipping 
General Acct. 
Cost Planning 
Budget Acct. 
Personnel Plan. 
Recruit Develop. 


XTIULVWW dOWSN SSadoOdd-VLVd 


UT 


Zopus/ 
Tetazsj7eW FO TIitd 
qZonporg 


seAoTduy 
TeToOUeUT A 


QW 
¢ 0 
ME 
ct ct 
O p- 
3 5 
© Q 
KO 


8 
0) 
we) 
wv 
(D 
3 
ct 
wn 


Hutuuetd 


= ty 
gy ey 
Q Q 
=F ae 
b- t 
3 H- 
© ct 

f- 
ee 0) 
O ni 
Qu 


= 
O 
K 
~ 
a 
eo 
as, 
my 
O 
Q 
K 
© 
n 
wn 


Zaysew sRzeg 


"AUT SpPOOy 
}"AUI TeTrTSQeEW MEY 


69) 
re) 
= 
49) 
69) 
KH 
(D 
4 
a) 
J. 
ct 
O 
kK 
KG 


Le 
Ta 


WLW 


SSdHo08dd 


Business Planning 
Org. Analysis 
Review & Control 
Financial Plan. 
Capital Acquisit. 
Research 
Forecasting 
Design & Develop. 
Prod. Spec. Maint. 
Purchasing 
Receiving 
Inventory Control 
Workflow Layout 
Scheduling 
Capacity Plan. 
Material Regmts. 
Operations 
Territory Mgmt. 
Selling 

Sales Admin. 
Order Servicing 
Shipping 

General Acct. 
Cost Planning 
Budget Acct. 
Personnel Plan. 
Recruit Develop. 
Compensation 


XIUYLWW NOITLVaYO SSdo0ud-WLvd 


BE 


PROCESS VS. PROCESS MATRIX 


Process 


Process 


1234506789 101112 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 | 


es Business Plan 1 4 id 1 
2% Org. Analysis 

4 Review & Control 

4. Financial Plan ; es 1 1 

Dig Capital Acquis. a Ll 1 iol 1 
6. Research 

i Forecasting 

8. Design & Develop. ; ce 1 1 il 1 1 1 1 1 

9. Prod. Spec. Maint. 1 1 

10. Purchasing 1 1 114i 1 1 

ll. Receiving 

12. Inventory Control 1 1 

13. Workflow Layout 1 il 

14. Scheduling 1 1 1 1 

15. Capacity Plan 1 1 

16. Material Regmt. 1 1 

17. Operations 1 1 | 

18. Territory Mgmt. ps a 1 1 1 

19. Selling J) ne 

20. Sales Admin. 

21. Order Servicing y od, 1 1 

22. Shipping 

23. Gen. Accounting 

24. Cost Planning 1 1 1 

25. Budget Account 

26. Personnel Plan 1 1 1 1 1 


Recruit Delevop 
Compensation 


Fig. 3 


672 


ONO OB WN FE 


kK 
©. 


Data 
Data 


Planning 
Financial 
Product 

Parts Master 
Bill of Material 
Vendor 


Raw Material Inv. 


Fin. goods Inv. 
Facilities 

Work in Progress 
Machine Load 
Open Reqmts. 
Routings 
Customer 

Sales Territory 
Order 

Cost 

Employee 


DATA VS. DATA MATRIX 


Fig. 4 


456¢789 1011 


IZ 13 24 15-16 27°26 


A TREE STRUCTURE REPRESENTATION 
~ OF THE INFORMATION FLOW PATHS 


Legend : | | =Process 


t =Data 


Fig. 10 


30 


SCIENTIFIC CENTER REPORT INDEXING INFORMATION 


1.. AUTHOR(S) : 9. SUBJECT INDEX TERMS 


Kweku Ewusi-Mensah Information Systems 
Business Organizations 
Requirements Analysis 
Data Usage/Creation 


Process/Function 


. TITLE > Computer-Aided Modeling and Analysis 
Techniques for Determining Management | 


PR 1 = 
NTOrmMsa ole oils atwis eT | 


3. ORIGINATING DEPARTMENT 
Los Angeles Scientific Center 


4. REPORT NUMBER 
G320-2703 


5a. NUMBER OF PAGES 


30 
DATE COMPLETED 
March Eo7o 


5b. NUMBER OF REFERENCES 


18 


6b. DATE OF INITIAL PRINTING 


6a. 6c. DATE OF LAST PRINTING 


March 1980 


7. ABSTRACT : 


This paper deals with the problem of determining the information 
requirements or needs of a business orqanization in the development 
of information systems. Solution characteristics of the problem 

are discussed and several forms of models are described based on 
concepts in elementary matrix algebra and qraph theory. We construct 
boolean matrices representing either data usage or creation by a 
process or function in a business organization. Products of the 

two matrices are represented as directed graphs to depict the flow 
of information in the business. Two procedures are described for 
realizing the solutions obtained from the analysis. The models’ 
applicability to other related areas of information systems research 
is discussed and an example is presented frrm an analysis of sample 
data. 


8. REMARKS : 


1977 IBM LOS ANGELES SCIENTIFIC CENTER 
OUTSIDE PUBLICATIONS | 


T. LANG & E. B. FERNANDEZ, Improving the Com- 
putation of Lower Bounds for Optimal Schedules, 1BM 
Journal of Research & Development, Vol. 21, No. 3, 
May 1977, 273-280. 


A. INSELBERG, (G320-2684) Variable Geometry 
Cochlear Model at Low Input Frequencies: A Basis for 
Compensating Morphological Disorders, |BM Journal of 
Research & Development, Vol. 21, No. 5, September 
1977, 461-478. 


T. LANG, E. NAHOURAII, K. KASUGA, E. B. 
FERNANDEZ, An Architectural Extension for a Large 
Database System Incorporating a Processor for Disk 
Search, Proceedings of the 3rd International Conference 
on Very Large Data Bases, [EEE Computer Society, or 
ACM, Tokyo, 1977, 204-210. 


T. LANG, E. B. FERNANDEZ, R. C. SUMMERS, A 
System Architecture for Compile-Time Actions in Data- 
bases, ACM 77 Proceedings of the Annual Conference, 
Seattle, Washington, October 17-19, 1977, 11-15. 


E. B. FERNANDEZ & C. WOOD, (G320-2685) The 
Relationship Between Operating System and Database 
System Security: A Survey, Proceedings of COMPSAC 
77, 1st International Computer Software Applications 
Conference, IEEE Computer Society, Chicago, Ill., 
November 8-11, 1977, 453-462. 


T. LANG, C. WOOD & E. B. FERNANDEZ 
(G320-2686), Database Buffer Paging in Virtual Storage 
Systems, ACM Transactions on Database Systems, Vol. 
2, No. 4, December 1977, 339-351. 


1978 IBM LOS ANGELES SCIENTIFIC CENTER 
| OUTSIDE PUBLICATIONS 


B. DIMSDALE, (G320-2692) Convex Cubic Splines, 
IBM J. Res. Develop., Vol. 22, No. 2, March 1978, 
168-1 78. 


E. B. FERNANDEZ, T. LANG, C. WOOD, Effect of 
Replacement Algorithms on a Paged Buffer Database 
System, IBM J. Res. Develop., Vol. 22 No. 2, March 
1978, 185-196. 


A. INSELBERG, (G320-2669) Cochlear Dynamics: the 
Evolution of a Mathematical Model, Siam Review, Vol. 
20, No. 2, April 1978, 301-351. 


S. A. JUROVICS, Optimization Applied to the Design of 
an Energy Efficient Building, IBM Journal of Research 
and Development, Vol. 22, No. 4, July 1978, 378-385. 


E. B. FERNANDEZ, R. C. SUMMERS, T. LANG, & C. 
D. COLEMAN, (G320-2683) Architectural Support for 
System Protection and Database Security, IEEE Trans- 
actions on Computers, C-27, No. 8, August 1978, 
767-771. 


R. C. SUMMERS & E. B. FERNANDEZ, An Approach 
to Data Security, Proceedings of the 8th Australian 
Computer Conference, September 1, 1978. 


D. W. LOW, A Directed Weather Data Filter, IBM 
Journal of Research & Development, Vol. 22, No. 5, 
September 1978, 487-497. 


STEPHAN A. JUROVICS & DAVID W. LOW, Opti- | 
mizing the Passive Solar Characteristics of Buildings, 
Presented at the Winter Annual Meeting of ASME, San 
Francisco, California, December 10-15, 1978, 43-51. 


Les Angeles Scientific Center, 9045 Lincoin Boulevard, Los Arigeles, California 90045 


